Slave Web Card Server #1 
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Web Server 
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Local Card 
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Slave Web Card Server #N 

(maybe operated by other ISP/ICP,)itc 
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3721™ Master Sewer 




372F Master 
Web Server 
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372F MasterCard 
Search Engine 



Master Card 
Database 



Connect to Internet 
through Firewall 
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Web Card Server in Intranet A 

(owned by company )k 
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Web Server 



Connect to Internet 
through Firewall 



Web Card Server in Intranet B 

(owned by company B) 
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public on Internet: 



Local Card 
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public on Internet 
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Memory structure(arrayjink list^tc) 
to store web cards data 



Hash table index entry for name search 
(64K Unicode Chars) 
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a list of pointer, each points to 
a card whose name is 

Hash table index entry for Finybi and 
Tongyin search 
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wang 




zhang 
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Peng 



a list of pointer, each points to a card whose 
name has the prounoucation 'li wei fen£ 



FIG 3A 



Memory structure(arrayJLi]ik list^tc) 

to store web cards data 

(the same structure as Fig3A) 



Hash table index entry for fuzzy search 
(64K Unicode Chars) 
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a list of pointer, each points to a card 
whose name or additional infomation 
include the word ' S R ®^ ' 
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J User input query string A 
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name search 



phonetic or homophony search 



Treat A as a Chinese chars string, 
then get it's phonetic spelling. 




homophony search 



Treat A directly as a phonetic 
spelling string. 
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phonetic search 



Parse the Hash table index tree(the upper I s CO 
onemFig3A)tobcateanodeNa.The JC* ^ 
path from the root of tree to Na exactly 
match string A. 
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Phonetic spelling A' 



Parse the Hash table index tree(the bwer 
one in Fig3A) to bcate anode Nb. The 
path from the root of tree to Nb will have 
a phonetic spelling matching string A'. 



Get the pointer list of node Na as the 
result Ra, Ra then will contain all the 
cards whose name exactly match A 
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j/ Resuh Ra 
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Get the pointer list of node Nb as the 
result Rb, Rb then will contain all the 
cards whose name's phonetic spelling 
exactly match A. 
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Result Rb 
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Combine set Ra and Rb to get the final 
result R. 
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Final result R 
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FIG 4 A 




Self-learning Chinese 
^ wordcktabase^ 
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Split A to one or more Chinese words: 
W={W^ 2 ,..,W H } 
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For each word W s in W, parse the Hash table 
index tree in Fig3B to bcate a node N x . The 
path from the root of tree to 1^ exactly match 
ward W 
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Get the pointer list of node 1^ as the result 
lii 1 x then will contain all the cards 
whose name or additional information include 
the word W„. 
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Get the union of R..R. „: 
R = R J UR 3 U...R H 
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While doing the unoin, calculate the weight of each result in R according to 
specified rules,such as: 

(1) weight of count: the number of words within W that the result contains. 

(2) weight of length: the total length of words within W that the result contains. 

(3) weight of word attribute: assign each word attribute (noun, verb, adj.,etc) a 
individual weight, then we can get a comprehensive weight. 

Finally, calculate the comprehensive weight of each result based on the above 
rules. 
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Sort the result list R according to weight of 
result, so as the most approximate result s 
appear at head of the list, and limit the 
number of result in the list 

, 7 - 

/Final result list R 
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FIG 4B 



